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DETAILED ACTION 

1. This action is responsive to communications: Request for Reconsideration (hereinafter the Request) 
filed 1/26/2007, to the original application filed 8/14/2000, with provisional filing date 4/6/2000. 

2. Claims 1-3, 5, 7-14, 16-24, 26 pending. Claims 1, 10, 12, 20, 22, 23, 26 are independent claims. 

Claim Rejections - 35 USC §103 

3. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all obviousness rejections set 
forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in section 102 of this title, if the 
differences between the subject matter sought to be patented and the prior art are such that the subject matter as a whole would have been 
obvious at the time the invention was made to a person having ordinary skill in the art to which said subject matter pertains. Patentability shall 
not be negatived by the manner in which the invention was made. 

4. Claims 1-3, 5, 7-14, 16-21 are rejected under 35 U.S.C. 103(a) as being unpatentable over Najork 
et al. (hereinafter "Najork '265"), US 6,321,265 Bl filed 11/2/1999 in view of Hoffert et al. (hereinafter 
Hoffert), US 6,374,260 Bl filed 2/28/2000. 

Regarding independent claim 1, Najork '265 teaches sending a request for additional 
links to hyperlinked documents to a link manager in fig. 2-4 and col. 5 line 53 - col. 6 line 6. 
The Frontier data structure queues organize and manage the links to hyperlinked documents and 
act as a link manager for the web crawler. The Frontier data structure provides links to the web 
crawler. Najork '265 teaches receiving a plurality of links to hyperlinked documents 
to be crawled in col. 1 lines 31-47 and col. 3 lines 3-52. Najork '265 teaches grouping the 
plurality of links to hyperlinked documents by host in fig. 7 and col. 2 lines 24-36. Najork '265 
teaches selecting a host to crawl next according to a stall time of the host in fig. 6, col. 1 line 60 
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- col. 2 line 2, col. 2 lines 37-62, and col. 3 lines 23-39. Najork '265 teaches crawling a 
hyperlinked document from the selected host in col. 1 lines 31-47 and col. 3 lines 3-52. 

Najork '265 does not teach that the plurality of links to be crawled are selected by the 
link manager based on priority. However, Hoffert teaches that the plurality of links (pages) to be crawled 
are selected based on priority (Hoffert col. 2 lines 30-36, col. 4 line 12-18). It would have been obvious to 
one of ordinary skill in the art at the time the invention was made to have combined Hoffert into 
Najork '265 to have created the claimed invention. It would have been obvious and desirable to have 
implemented a prioritized selection of the links so that media rich content or rapidly changing would have 
higher priority and crawled first, facilitating quicker indexing (Hoffert column 4 lines 15-17). 

Regarding dependent claim 2, Najork '265 teaches wherein the stall time of the host is 
the earliest time in which a hyperlinked document from the host should be crawled in fig. 6 col. 
1 line 60 -col. 2 line 2 5 col. 2 lines 37-62, and col. 3 lines 23-39. 

Regarding dependent claim 3, Najork '265 teaches selecting a host with a stall time that 
is earlier than the current time in fig. 6, col. 1 line 60 - col. 2 line 2, col. 2 lines 37-62, and col. 3 
lines 23-39. 

Regarding dependent claim 5, Najork '265 teaches examining hyperlinked documents 
to be crawled at each host until a host is found with a stall time that is earlier than the current 
time in fig. 5-7, col. 1 line 3 1 - col. 2 line 2, and col. 2 lines 37-62. Najork '265 does not teach 
examining the groups in the specific "descending" order of the humber of hyperlinked documents 
to be crawled. It would have been obvious to one of ordinary skill in the art at the time of the 
invention to have modified Najork '265 to have taught this limitation. It would have been 
obvious and desirable to have examined the hosts in a descending order of hyperlinked 
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documents to be crawled so that the hosts were examined in a logical order. This 

would have made the link selection process for the web crawler straightforward and easy to 

program. 

Regarding dependent claim 7, Najork '265 does not teach moving the selected host to a 
group with one less hyperlinked documents to be crawled. It would have been obvious to one of 
ordinary skill in the art at the time the invention was made to have modified Najork '265 so that 
the a selected host would have been moved to a group with one less hyperlinked documents to be 
crawled. It would have been obvious and desirable to have done this so that the groups would 
have remained balanced for the web crawler. 

Regarding dependent claim 8, Najork '265 teaches determining a retrieval time for 
retrieving the hyperlinked document from the selected host in col. 2 lines 43-52. 

Regarding dependent claim 9, Najork '265 teaches adjusting subsequent stall times for 
the selected host according to the retrieval times in col. 2 lines 43-52. 

Regarding independent claim 10, Najork '265 teaches computer code that requests 
links from a link manager in fig. 2-4 and col. 5 line 53 - col. 6 line 6. The Frontier data structure 
queues organize and manage the links to hyperlinked documents and act as a link manager for 
the web crawler. The Frontier data structure provides links to the web crawler. Najork '265 
teaches receiving a plurality of links to hyperlinked documents to be crawled in col. 1 lines 31- 
47 and col. 3 lines 3-52. Najork 765 teaches grouping the plurality of links to hyperlinked 
documents by host in fig. 7 and col. 2 lines 24-36. Najork '265 teaches selecting a host to crawl 
next according to a stall time of the host in fig. 6, col. 1 line 60 - col. 2 line 2, col. 2 lines 37-62, 
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and col. 3 lines 23-39. Najork '265 teaches crawling a hyperlinked document from the selected 
host in col. 1 lines 31-47 and col. 3 lines 3-52. Najork '265 teaches a computer readable medium 
that stores computer codes in fig. 1 and col. 1 lines 13-30. 

Najork '265 does not teach that the plurality of links to be crawled are selected by the 
link manager based on priority. However, Hoffert teaches that the plurality of links (pages) to be crawled 
are selected based on priority (Hoffert col. 2 lines 30-36, col. 4 line 12-18). It would have been obvious to 
one of ordinary skill in the art at the time the invention was made to have combined Hoffert into 
Najork '265 to have created the claimed invention. It would have been obvious and desirable to have 
implemented a prioritized selection of the links so that media rich content or rapidly changing would have 
higher priority and crawled first, facilitating quicker indexing (Hoffert column 4 lines 15-17). 

Regarding dependent claim 11, Najork '265 teaches a computer readable medium 
which is a CD-ROM, floppy disk, tape, flash memory, system memory, hard drive in fig. 1 and col. 1 lines 
13-30. 

Regarding independent claim 12, Najork '265 teaches sending a request for additional 
links to hyperlinked documents to a link manager in fig. 2-4 and col. 5 line 53 - col. 6 line 6. 
The Frontier data structure queues organize and manage the links to hyperlinked documents and 
act as a link manager for the web crawler. The Frontier data structure provides links to the web 
crawler. Najork '265 teaches receiving a plurality of links to hyperlinked documents to be 
crawled in col. 1 lines 31-47 and col. 3 lines 3-52. Najork '265 teaches grouping the plurality of 
links to hyperlinked documents by host in fig. 7 and col. 2 lines 24-36. Najork '265 teaches 
selecting a host to crawl next according to a stall time of the host in fig. 6, col. 1 line 60 - col. 2 
line 2, col. 2 lines 37-62, and col. 3 lines 23-39. Najork '265 teaches crawling a hyperlinked 
document from the selected host in col. 1 lines 31-47 and col. 3 lines 3-52. Najork '265 teaches 
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determining a retrieval time for retrieving the hyperlinked document from the selected host in 
fig. 6, col. 1 line 60 - col. 2 line 2, col. 2 lines 37-62, and col. 3 lines 23-39. Najork '265 teaches 
adjusting subsequent stall times for the selected host according to the retrieval time in fig. 6, col. 
1 line 60 - col. 2 line 2, col. 2 lines 37-62, and col. 3 lines 23-39. 

Najork '265 does not teach that the plurality of links to be crawled are selected by the 
link manager based on priority. However, Hoffert teaches that the plurality of links (pages) to be crawled 
are selected based on priority (Hoffert col. 2 lines 30-36, col. 4 line 12-18). It would have been obvious to 
one of ordinary skill in the art at the time the invention was made to have combined Hoffert into 
Najork '265 to have created the claimed invention. It would have been obvious and desirable to have 
implemented a prioritized selection of the links so that media rich content or rapidly changing would have 
. higher priority and crawled first, facilitating quicker indexing (Hoffert column 4 lines 15-17). 

Regarding dependent claim 13, Najork '265 teaches wherein the stall time of the host is 
the earliest time in which a hyperlinked document from the host should be crawled in fig. 6, col. 
1 line 60 - col. 2 line 2, col. 2 lines 37-62, and col. 3 lines 23-39. 

Regarding dependent claim 14, Najork '265 teaches selecting a host with a stall time 
that is earlier than the current time in fig. 6, col. 1 line 60 - col. 2 line 2, col. 2 lines 37-62, and 
col. 3 lines 23-39. 

Regarding dependent claim 16, Najork '265 teaches examining the groups in descending order 
of the number of hyperlinked documents to be crawled at each host until a host is found with a stall 
time that is earlier than the current time in fig. 5-7, col. 1 line 3 1 - col. 2 line 2, and col. 2 lines 37-62. 

Regarding dependent claim 17, Najork '265 teaches sorting the hosts in fig. 6-7 and 
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col. 1 line 60 col. 2 line 2 and col. 2 lines 37-62. Najork '265 does not specifically teach 
sorting the hosts by stall time. It would have been obvious to one of ordinary skill in the art at 
the time the invention was made to have modified Najork '265 to have sorted the hosts according 
to stall time. It would have been obvious and desirable to have done this so that the web crawler 
could have crawled the sites in a time-efficient order. 

Regarding dependent claim 18, Najork '265 does not teach moving the selected host to 
a group with one less hyperlinked documents to be crawled. It would have been obvious to one 
of ordinary skill in the art at the time the invention was made to have modified Najork '265 so 
that the a selected host would have been moved to a group with one less hyperlinked documents 
to be crawled. It would have been obvious and desirable to have done this so that the groups 
would have remained balanced for the web crawler. 

Regarding dependent claim 19, Najork '265 teaches determining a retrieval time for 
retrieving the hyperlinked document from a selected host in col. 3 lines 23-29. 

Regarding independent claim 20, Najork '265 teaches sending a request for additional 
links to hyperlinked documents to a link manager in fig. 2-4 and col. 5 line 53 - col. 6 line 6. 
The Frontier data structure queues organize and manage the links to hyperlinked documents and 
act as a link manager for the web crawler. The Frontier data structure provides links to the web 
crawler. Najork '265 teaches receiving a plurality of links to hyperlinked documents 
to be crawled in col. 1 lines 31-47 and col. 3 lines 3-52. Najork '265 teaches grouping the 
plurality of links to hyperlinked documents by host in fig. 7 and col. 2 lines 24-36. 

Najork *265 teaches selecting a host to crawl next according to a stall time of the host in 
fig. 6, col. 1 line 60 - col. 2 line 2, col. 2 lines 37-62, and col. 3 lines 23-39. Najork '265 teaches 
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crawling a hyperlinked document from the selected host in col. 1 lines 3 1-47 and col. 3 lines 3- 
52. Najork '265 teaches determining a retrieval time for retrieving the hyperlinked document 
from the selected host in fig, 6, col. 1 line 60 - col. 2 line 2, col. 2 lines 37-62, and col. 3 lines 
23-39. Najork '265 teaches adjusting subsequent stall times for the selected host according to 
the retrieval time in fig. 6, col. 1 line 60 -col. 2 line 2, col. 2 lines 37-62, and col. 3 lines23-39. 
Najork '265 teaches a computer readable medium that stores computer codes in fig. 1 and col. 1 
lines 13-30. 

Najork '265 does not teach that the plurality of links to be crawled are selected by the 
link manager based on priority. However, Hoffert teaches that the plurality of links (pages) to be crawled 
are selected based on priority (Hoffert col. 2 lines 30-36, col. 4 line 12-18). It would have been obvious to 
one of ordinary skill in the art at the time the invention was made to have combined Hoffert into 
Najork '265 to have created the claimed invention. It would have been obvious and desirable to have 
implemented a prioritized selection of the links so that media rich content or rapidly changing would have 
higher priority and crawled first, facilitating quicker indexing (Hoffert column 4 lines 15-17). 

Regarding dependent claim 21, Najork '265 teaches a computer readable medium 
which is a CD-ROM, floppy disk, tape, flash memory, system memory, hard drive in fig. 1 and col. 1 lines 
13-30. 
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5. Claims 22-24 are rejected under 35 U.S.C. 103(a) as being unpatentable over Najork et al. 
(hereinafter "Najork '265"), US 6,321,265 Bl filed 11/2/1999 in view of Najork et al. (hereinafter "Najork 
'755"), US 6,351,755 Bl filed 11/2/1999. 

Regarding independent claim 22, Najork '265 teaches storing a plurality of links to 
hyperlinked documents to be crawled in col. 1 lines 31-47. Najork '265 teaches receiving 
additional links to hyperlinked documents in fig. 1 and col. 3 lines 3-52. Najork '265 teaches 
selecting a host to crawl next according to a stall time of the host in fig. 6, col. 1 line 60 - col. 2 
line 2 and col. 2 lines 37-62. Najork '265 teaches crawling a hyperlinked document from the 
selected host in col. 1 lines 31-47. 

Najork '265 teaches determining that more links to hyperlinked documents are desired 
and sending requests to multiple link managers for more links to hyperlinked documents in fig. 
2-4 and col. 5 line 53 - col. 6 line r 6. The Frontier data structure queues organize and manage the 
links to hyperlinked documents and act as link managers for the web crawler. Each link queue 
manages links from a specific host to provide to the web crawler and thus is similar to each of 
the claimed link managers. 

Najork '265 does not explicitly teach sending requests to multiple link managers for more 
links to hyperlinked documents. However, Najork £ 755 teaches queues containing URL's from multiple 
server hosts, as well as teaching multiple queues implemented in various scenarios (Najork '755 column 
16 lines 10-26). it would have been obvious to one of ordinary skill in the art at the time the invention was 
made to have combined Najork 755 into Najork '265 and modified the Frontier queuing and indexing 
system of the combination to have operated the queues as individual link managers so that links could have 
still be provided to the web crawler in the event one of the queues experienced an interrupted connection 
with the web crawler. 
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Regarding independent claim 23, Najork '265 teaches storing a plurality of links to 
hyperlinked documents to be crawled in col. 1 lines 31-47. Najork '265 teaches receiving 
additional links to hyperlinked documents in fig. 1 and col. 3 lines 3-52. Najork '265 teaches 
selecting a host to crawl next according to a stall time of the host in fig. 6, col. 1 line 60 - col. 2 
line 2 and col. 2 lines 37-62. Najork '265 teaches crawling a hyperlinked document from the . 
selected host in col. 1 lines 31-47. Najork '265 teaches a computer readable medium that stores 
computer codes in fig. 1 and col. 1 lines 13-30. 

Najork '265 teaches determining that more links to hyperlinked documents are desired 
and sending requests to multiple link managers for more links to hyperlinked documents in fig. 
2-4 and col. 5 line 53 - col. 6 line 6. The Frontier data structure queues organize and manage the 
links to hyperlinked documents and act as link managers for the web crawler. Each link queue 
manages links from a specific host to provide to the web crawler and thus is similar to each of 
the claimed link managers. 

Najork '265 does not explicitly teach sending requests to multiple link managers for more 
links to hyperlinked documents. However, Najork '755 teaches queues containing URL's from multiple 
server hosts, as well as teaching multiple queues implemented in various scenarios (Najork '755 column 
16 lines 10-26). it would have been obvious to one of ordinary skill in the art at the time the invention was 
made to have combined Najork 755 into Najork '265 and modified the Frontier queuing and indexing 
system of the combination to have operated the queues as individual link managers so that links could have 
still be provided to the web crawler in the event one of the queues experienced an interrupted connection 
with the web crawler. 

Regarding dependent claim 24, Najork '265 teaches a computer readable medium which is a CD- 
ROM, floppy disk, tape, flash memory, system memory, hard drive in fig. 1, and col.. 1 lines 
13-30. 
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6. Claim 26 is rejected under 35 U.S.C. 103(a) as being unpatentable over Najork f 265. 

Regarding independent claim 26, Najork '265 teaches sending a request for additional 
links to hyperlinked documents to a link manager in fig. 2-4 and col. 5 line 53 - col. 6 line 6. 
The Frontier data structure queues organize and manage the links to hyperlinked documents and 
act as a link manager for the web crawler. The Frontier data structure provides links to the web 
crawler. Najork '265 teaches receiving a plurality of links to hyperlinked documents 
to be crawled in col. 1 lines 3 1-47 and col. 3 lines 3-52. Najork '265 teaches grouping the 
plurality of links to hyperlinked documents by host in fig. 7 and col. 2 lines 24-36. Najork '265 
teaches selecting a host to crawl next according to a stall time of the host in fig. 6, col. 1 line 60 
- col. 2 line 2, col. 2 lines 37-62, and col. 3 lines 23-39. Najork '265 teaches crawling a 
hyperlinked document from the selected host in col. 1 lines 31-47 and col. 3 lines 3-52. 

Najork '265 teaches selecting a host with a stall time that is earlier than the current time in 
fig. 6, col. 1 line 60 - col. 2 line 2, col. 2 lines 37-62, and col. 3 lines 23-39. 

Najork '265 does not teach examining the groups in the specific "descending" order of the number of 
hyperlinked documents to be crawled. However, it would have been obvious to one of ordinary skill in the 
art at the time of the invention to have modified Najork '265 to have taught this limitation. It would have 
been obvious and desirable to have examined the hosts in an order (such as descending order) of 
hyperlinked documents to be crawled so that the hosts were examined in a logical order. This would have 
made the link selection process for the web crawler straightforward and easy to program. 
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Response to Arguments 

7. Applicant's arguments filed 1/26/2007 have been fully and carefully considered but they are not 
persuasive. 

Applicant argues on pages 2-3 of the Request that NAJORK '265 and HOFFERT et al. do not disclose 
or suggest grouping hosts into buckets according to a number of hyperlinked documents to be crawled at each 
host. Applicant further asserts that the Office Action does not address this feature. The examiner respectfully 
disagrees. Najork '265 teaches a Frontier data structure providing links to a web crawler, as well as teaching 
receiving a plurality of links to hyperlinked documents to be crawled. Najork '265 teaches grouping the 
plurality of links to hyperlinked documents by host. Since "buckets" serve to group items accordingly, grouping 
links by host can be reasonable interpreted as grouping into buckets. Najork '265 also teaches selecting a host to 
crawl next according to a stall time of the host. 

Applicant argues on pages 5-8 of the Request that For .example, NAJORK *265 and NAJORK 755 do 
not disclose or suggest "sending requests to multiple link managers for more links to hyperlinked documents". 
The examiner respectfully disagrees. As noted in the instant rejection, Najork *265 does not explicitly teach 
sending requests to multiple link managers for more links to hyperlinked documents. However, Najork '755 
teaches queues containing URL's from multiple server hosts, as well as teaching multiple queues implemented 
in various scenarios. This teaching is combined with Najork '265 accordingly. Applicant argues on page 7 of the 
Request that Najork £ 755 does not teach the above limitation. The examiner respectfully disagrees, Najork '265 
teaches determining that more links to hyperlinked documents are desired and sending requests to multiple link 
managers for more links to hyperlinked documents in fig. 2-4 and col. 5 line 53 - col. 6 line 6. The Frontier data 
structure queues organize and manage the links to hyperlinked documents and act as link managers for the web 
crawler. Each link queue manages links from a specific host to provide to the web crawler and thus is similar to 
each of the claimed link managers. The examiner relies upon Najork 755 to produce a system whereby the 
queues are operated as individual link managers. 
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Applicant argues on pages 9-10 of the Request that the cited references do not teach sorting the hosts in 
each bucket based on stall time. The examiner respectfully disagrees. Najork '265 teaches grouping the plurality 
of links to hyperlinked documents by host, as well as teaching selecting a host to crawl next according to a stall 
time of the host. 



Conclusion 

8. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time policy as set forth 
in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE MONTHS from the 
mailing date of this action. In the event a first reply is filed within TWO MONTHS of the mailing date of this 
final action and the advisory action is not mailed until after the end of the THREE-MONTH shortened statutory 
period, then the shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of the advisory action. In no 
event, however, will the statutory period for reply expire later than SIX MONTHS from the mailing date of this 
final action. 

9. Any inquiry concerning this communication or earlier communications from the examiner should be 
directed to William L. Bashore whose telephone number is (571) 272-4088. The examiner can normally be 
reached on 1 1 :30am - 8:00pm EST. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's supervisor, Heather 
Herndon can be reached on (571) 272-4136. The fax phone number for the organization where this application 
or proceeding is assigned is 571-273-8300. 
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10. Information regarding the status of an application may be obtained from the Patent Application 
Information Retrieval (PAIR) system. Status information for published applications may be obtained from 
either Private PAIR or Public PAIR. Status information for unpublished applications is available through 
Private PAIR only. For more information about the PAIR system, see http://pair-direct.uspto.gov. Should you 
have questions on access to the Private PAIR system, contact the Electronic Business Center (EBC) at 866-217- 
9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative or access to the 
automated information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 



WILUAM BASHORE 
PRIMARY EXAMINER 



April 11,2007 



